The Extraction of Enriched Protein-Protein Interactions from Biomedical Text
نویسندگان
چکیده
There has been much recent interest in the extraction of PPIs (protein-protein interactions) from biomedical texts, but in order to assist with curation efforts, the PPIs must be enriched with further information of biological interest. This paper describes the implementation of a system to extract and enrich PPIs, developed and tested using an annotated corpus of biomedical texts, and employing both machine-learning and rulebased techniques.
منابع مشابه
Collection-Wide Extraction of Protein-Protein Interactions
Evidence in support of relationships among biomedical entities, such as protein-protein interactions, can be gathered from a multiplicity of sources. The larger the pool of evidence, the more likely a given interaction can be considered to be. In the context of biomedical text mining, this elementary observation can be translated into an approach that seeks to find in the literature all availab...
متن کاملThe ITI TXM Corpora: Tissue Expressions and Protein-Protein Interactions
We report on two large corpora of semantically annotated full-text biomedical research papers created in order to develop information extraction (IE) tools for the TXM project. Both corpora have been annotated with a range of entities (CellLine, Complex, DevelopmentalStage, Disease, DrugCompound, ExperimentalMethod, Fragment, Fusion, GOMOP, Gene, Modification, mRNAcDNA, Mutant, Protein, Tissue)...
متن کاملUsing Lexical Chaining to Rank Protein-Protein Interactions in Biomedical Text
Biomedical information extraction is becoming an increasingly important application of Computational Linguistics research. We propose a method for analyzing full-text articles on protein interactions that takes a discourse-based approach to provide a means of ranking the biological validity of such interactions. Specifically, we use lexical chaining—strings of semantically related words—as an i...
متن کاملMining relations from the biomedical literature
Text mining deals with the automated annotation of texts and the extraction of facts from textual data for subsequent analysis. Such texts range from short articles and abstracts to large documents, for instance web pages and scientific articles, but also include textual descriptions in otherwise structured databases. This thesis focuses on two key problems in biomedical text mining: relationsh...
متن کاملFrom Biomedical Literature to Knowledge: Mining Protein-Protein Interactions
To date, more than 16 million citations of published articles in biomedical domain are available in the MEDLINE database. These articles describe the new discoveries which accompany a tremendous development in biomedicine during the last decade. It is crucial for biomedical researchers to retrieve and mine some specific knowledge from the huge quantity of published articles with high efficiency...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007